How Netflix Uses AI, Data Science, and Machine Learning

Today, I am talking about how Netflix uses AI, Data Science, and Machine Learning.

The presence of AI in today’s society is becoming more and more ubiquitous particularly as large companies like Netflix, Amazon, Facebook, Spotify, and many more continually deploy AI-related solutions that directly interact with consumers every day.

When properly applied to business problems, these AI-related solutions can provide really unique solutions that scale and improve over time, creating significant impact for both business and user. But what does it mean to “properly apply” an AI solution? Does that mean there is a wrong way? From a product perspective, the short answer is yes, and we’ll get to why that is later in this article as we dig deeper.

Overview: First, we will outline 5 use cases of

or machine learning at Netflix. We’ll then discuss some business needs vs technical considerations a Product Manager would look at. Then we will dive a little deeper into what is perhaps the most interesting of these 5 use cases as we identify what business problem it seeks to solve.

Five Use Cases of AI/Data/Machine Learning at Netflix :

Personalization of Movie Recommendations: Users who watch A are likely to watch B. This is perhaps the most well-known feature of Netflix. Netflix uses the watching history of other users with similar tastes to recommend what you may be most interested in watching next so that you stay engaged and continue your monthly subscription for more.
Auto-Generation and Personalization of Thumbnails: Using thousands of video frames from an existing movie or show as a starting point for thumbnail generation, Netflix annotates these images then ranks each image in an effort to identify which thumbnails have the highest likelihood of resulting in your click. These calculations are based on what others who are similar to you have clicked on. One finding could be that users who like certain actors/movie genres are more likely to click thumbnails with certain actors/image attributes.
Location Scouting for Movie Production: Using data to help decide on where and when best to shoot a movie set given constraints of schedule, budget, and production scene requirements. Notice this is more of a data science optimization problem rather than a machine learning model that makes predictions based on past data.
Movie Editing: Using historical data of when quality control checks have failed in the past to predict when a manual check is most beneficial in what could otherwise be a very time-intensive and laborious process.
Streaming Quality: Using past viewing data to predict bandwidth usage to help Netflix decide when to cache regional servers for faster load times during peak demand.

These 5 use cases of data science or machine learning just in Netflix alone have had such a scalable impact that they have forever changed the technology landscape and user experience for millions and more to come. Adoption of these AI-related solutions is only going to get stronger over time.

But before these use cases were as commonplace as they are today and used by users like you and me, someone, or some group within Netflix properly connected these AI solutions with a business need. Without this business link, these use cases would simply be pie in the sky ideas sitting at the bottom of a backlog like so many other great ideas. Only through proper positioning and connection with Netflix’s core business problem did these ideas become the reality that they are today.

What is the Business Problem?

Notice in each of the use cases I’ve identified above, each one is associated with a specific business need, goal, or hypothesis.

This is absolutely important for any product manager to avoid the temptation of the tech enthusiast who marvels in the details of the data science or ML for intellectual reasons without clearly identifying the problem or business need potentially using up valuable technical resources with no business impact.

At the end of the day, product managers need to properly connect a business problem to a data machine learning solution. We want to avoid having a solution that’s chasing for a problem, otherwise, the project will lose momentum within the company: engineers won’t be clear what their North star is, stakeholders across the organization won’t buy-in and allocate the necessary resources to make the project a success, etc.

Make sure there is a problem to which an AI solution can be directly connected

Machine learning is a potential AI solution but we need to first define the problem before prescribing that solution.

What’s the business result we are trying to achieve with ML? Because this core business need is what drives the parameters of the ML models used, what data is collected and processed, etc. We don’t do ML to provide personalization just because it’s interesting tech we need to link it to a business problem.

Yes, that would be a pretty awesome use case leveraging natural language processing to understand your post-episode commentary in context. In addition to NLP, this use case uses text to voice personalities as well as sentiment analysis of how thousands of others felt about what happened in that episode, or how they feel about a certain character. Indeed, this is a beautiful merging of multiple cutting edge technologies in one use case.

If a pilot MVP version of this showed that users who engaged with his new feature stayed longer or came back more often or helped drive more word of mouth about Netflix, then it could warrant further resources. The initial decision to build that MVP would depend on the strategic decisions made by stakeholders, not necessarily prioritized by metric. That will depend on company strategy.

But as beautiful of a user scenario the above is, what problem does that solve?

How does it relate to Netflix’s main problem of keeping users subscribed every month? If it’s related, what evidence qualitative or quantitative do we have to support that relationship?

And if this is a legitimate solution to that problem, is there a simpler version of this solution that could equally accomplish that problem but be less technically complex? For example, instead of voice input and voice output, how might the complexity of just text input and text output affect the level of effort and impact on user engagement?

What if a conversational AI interface without the voice part achieved 80% of the intended user engagement but only required 40% of the development effort? Would it be worth considering such an alternative route?

What business impact would such a solution have in comparison to the level of effort? How does this ratio compare with that of other competing tasks in the backlog?

These are all product-focused questions that a PM should be asking in order to align technology solutions with business needs. Because ultimately, it’s the business need that drives the parameters of an ML model, not the other way around.

Movie Recommendations: Identifying the Problem

Here the problem is that Netflix has a huge collection of content that is constantly changing and can be overwhelming for a user to consume. Users don’t want to be frustrated in finding content relevant to their interests. So then, what is the best way to allow each user to consume that data in a way that ultimately maximizes subscription loyalty?

Product Goals include:

Increase or maintain viewership in terms of minutes consumed,
Increase in of titles explored, frequency of logging back in
Exceeding whichever minimum threshold that the company determines is a success metric
The overall increase in monthly subscription loyalty or decrease in subscriber cancellations.

Personalized Image Thumbnail or Artwork: Identifying the Problem

This use case is a subset of Movie Recommendations. Given that movie recommendations are provided to the user, we now have yet another business or user problem.

Problem: How do we best present that movie recommendation to the user in a way that maximizes viewership and monthly subscriber loyalty?

Well, one way to provide that recommendation is through an image thumbnail but what kind of thumbnail do we provide? And how confident are we that tweaking an image thumbnail will affect viewership or subscriber loyalty in a positive way?

And how important is that thumbnail? Do we have data for that?

Gathering Data to Support That Hypothesis

Well, you can be assured that some product-focused individual at Netflix at a time prior to 2014 was asking these exact same questions internally. And that individual or group worked together to put together user studies or data elsewhere, to prove that there was indeed a strong link between an image thumbnail and viewership.

That was their hypothesis: that adjusting the artistic content of an image thumbnail could have a strong link to viewership.

Thumbnails Are Important. But What Exactly Do We Tweak?

And how does an unstructured data set like a bunch of image thumbnails get fed into a digital/mathematical machine learning model? We’ll answer this second question further below.

First, given how important the thumbnail was to a user’s decision to watch something, how can Netflix generate better thumbnails for each user to increase the chance that a user will watch a video?

Using the movie’s original art as the only thumbnail used for every single person most likely won’t yield the highest click rates. The business is likely leaving clicks on the table!

Which actor character should be on that thumbnail, if any? How many? Which auto-generated frame or poster variation would be most enticing for a particular user to click on? What lighting works best? Filters?

What data do we have on other users’ past clicking behavior can we draw associations from to help inform this thumbnail decision at scale?

Increase click-thru-rates of movie recommendations signifying engagement
The hypothesis that higher engagement rates will lead to higher subscriber satisfaction and loyalty

So this is a really interesting problem with the image thumbnail that can have a huge impact on the likelihood that someone will click on a video and watch.

Product Considerations In Personalized Image Thumbnails

We won’t dive into each of the use cases above, but let’s dive a little further into the second one: Artwork / Thumbnail Personalization

This is a data-driven personalization feature that sits on top of the Movie recommendation engine

Product Considerations

Algorithms are great, but they do have limitations. A product manager should always think ahead of possible edge-case scenarios in which the algorithm may fail to produce the best results.

Each movie should ideally have a personalized thumbnail that maximizes clicks. Since Netflix has data on clicking behavior of other people with similar interests, it is a reasonable hypothesis to guess that if other people with similar interests and watch history had a high click-thru rate on a certain thumbnail, then it is likely that this image thumbnail will perform will on a new person who hasn’t yet been recommended this movie or thumbnail.
The personalized thumbnail should take into consideration other movies there are being recommended at the same time and what those image recommendations are. Let’s say Netflix is recommending 2 different Spiderman movies to a user side by side and they both have Spiderman facing the camera mask off. One is Tobey Maguire and the other is Andrew Garfield. Wouldn’t it be weird for the user to see both portraits of Maguire and Garfield as Spiderman with their masks off side by side? Something to account for if that ever were to occur. One image thumbnail could work well in isolation, but that may not be good enough when a page of a dozen thumbnails shows up. If they are all optimized to look the same way, then as a group, each one may seem less compelling. So looking at each thumbnail together with what else is being presented will be important.
Data is great, but watch out for algorithms that do their job too well, resulting in unintended consequences or false positives! In statistics, they call this a Type I error falsely suggesting an image thumbnail that shouldn’t be suggested.

Case in point: Just look at the example below of Like Father, a movie starring Kristen Bell. Yet, Netflix’s algorithm made false thumbnail recommendations of supporting black actors who don’t really represent what the movie was about but did experience a higher click rate among certain ethnic audiences.

What Data Do We Have?

There are 2 parts to this: What data does Netflix use to create these personalized thumbnails or artwork?
What data does Netflix use to target these custom-created thumbnails to the appropriate individual?

For the first question, consider that

A 1-hour episode of Stranger Things has >86,000 static video frames
These video frames can each individually be assigned certain attributes that are later used to filter down to the best thumbnail candidates through a set of tools and algorithms called Aesthetic Visual Analysis. This is designed to find the best custom thumbnail image out of every static frame of the video
Netflix Annotation: Netflix creates metadata for each frame including brightness (.67), * of faces (3), skin tones (.2), probability of nudity (.03), level of motion blur (4), symmetry (.4)
Netflix Image Ranking: Netflix uses the metadata from above to pick out specific images that are the highest quality and most clickable

For the second question of what data Netflix uses to identify who to target these custom-generated thumbnails towards, consider that Netflix tracks:

* of movies watched, # of minutes of each show watched
% of completion for every video/series
* of up-votes, which movies were favorites, etc
% of overall watch content that is attributable to any specific show any seasonal or weekly trends related to a user’s level of engagement, etc.

How Netflix Uses Data to Construct A Universe of User Profile Interests

Well, they use it to put together a 360 profile of each user and mathematically index every user according to hundreds, possibly thousands of different attributes.

They do this in order to try to group people with similar interests together so they can use data from one user to help predict the likely behavior of other similar users.

How does this grouping of similar user profiles work and how does a product manager make sense of the data?

Having gone through the complex math and algorithms associated with matrices, vectors, and n-dimensional feature analysis, I found the easiest way to understand how this works is through a 3D-spatial representation of 10+ dimensions.

Here’s a screenshot I took when using Google’s Tensor-board on the mints database of handwritten digits. It’s a fancy plot called the t-SNE plot effectively a 3D representation of a lot more dimensions than just 3. In this case, we are showing 10 dimensions on a 3D sphere-like coordinate system.

Each hand-written digit’s position in this spatial representation can be described by a vector a coordinate-like series of numbers across however many feature dimensions.

Likewise, with Netflix users, each user profile’s position in the above chart could be described by numerical values each representing an individual dimension of that user’s interest including movie genre, favorite actors or actresses, movie topic, etc.

Reimagining Netflix Users in Mathematical Relation To Each Other

Let’s pretend in the digits diagram above that:

“6” = romantic comedy
“4” = thriller

If a user is labeled a “6” by Netflix, then he/she will be placed in the general vicinity of where all the other turquoise 6’s are in the above spatial representation.

Likewise, if a user is labeled a “4” by Netflix, then he/she will be placed in the general vicinity of where all the other magenta 4’s are in the above spatial representation.

What Did Netflix Learn From All This Data?

Now that we know how Netflix turns images into numbers in a machine learning model, what are some insights Netflix has found from all the data processing and A/B tests they have conducted for so many years?

Well, besides learning the millions of individual thumbnails that converted users to loyal subscribers over time, here are a few additional things Netflix has learned for what works in terms of thumbnails:

Show close-ups of emotionally expressive faces
Show people villains instead of heroes
Don’t show more than three characters

Netflix Deployed AI in the Right Way. Let’s Learn From Their Approach.

Netflix has done a phenomenal job of applying AI, data science, and machine learning the “right way” using a product-based approach that focuses on business needs first, then AI solution next, rather than the other way around.

When applied properly, AI can do wonders.

We’ve seen how effective AI solutions can be in personalizing the experience for the benefit of both Netflix in terms of subscriptions and users in terms of overall satisfaction.

We’ve also seen limitations of algorithms that “overdo it” and discussed specific examples in which the Netflix algorithm presented misleading thumbnails to people of color because the algorithm optimized for clicks, effectively “tricking” the users into clicking bait. This happened even when that thumbnail did not accurately represent that video.

No algorithm will be perfect in accounting for all the nuances of a human experience. In fact, algorithms designed to exploit metrics will do just that so it is the role of the product manager to work with design or other team members to find ways to address these deficiencies in algorithms.

Going forward, the integration of AI in society as well as in the corporate enterprise space will continue to become more and more prevalent.

Technologists may have a tendency to prescribe existing AI solutions, but really the most effective way to adopt AI is the way Netflix did from a business-driven perspective first.

Dig deep and you will see that Netflix generated supporting data before making the strategic move forward.

As the world of AI, data science, and machine learning continues to grow, we product managers can all take a lesson or two out of the Netflix playbook when it comes to properly deploy AI solutions.

There is no expert who can remain an expert without sharing their knowledge. So, keep sharing your knowledge with everyone.

GyamaTech | Blog

How Netflix Uses AI, Data Science, and Machine Learning

Thumbnails Are Important. But What Exactly Do We Tweak?

Product Considerations In Personalized Image Thumbnails

How Netflix Uses Data to Construct A Universe of User Profile Interests

Reimagining Netflix Users in Mathematical Relation To Each Other

What Did Netflix Learn From All This Data?

Netflix Deployed AI in the Right Way. Let’s Learn From Their Approach.

Post a Comment

How To Choose A Programming Language?

How to Overclock the CPU?

Internet Of Things

Best Free Hosting Providers

Learn About Internet Cookies